Building and Using a Lexical Knowledge Base of Near-Synonym Differences
نویسندگان
چکیده
Choosing the wrong word in a machine translation or natural language generation system can convey unwanted connotations, implications, or attitudes. The choice between near-synonyms such as error, mistake, slip, and blunder — words that share the same core meaning, but differ in their nuances — can be made only if knowledge about their differences is available. We present a method to automatically acquire a new type of lexical resource: a knowledgebase of near-synonym differences. We develop an unsupervised decision-list algorithm that learns extraction patterns from a special dictionary of synonym differences. The patterns are then used to extract knowledge from the text of the dictionary. The initial knowledge-base is later enriched with information from other machine-readable dictionaries. Information about the collocational behavior of the near-synonyms is acquired from free text. The knowledge-base is used by Xenon, a natural language generation system that shows how the new lexical resource can be used to choose the best near-synonym in specific situations.
منابع مشابه
Building a Lexical Knowledge-Base of Near-Synonym Differences
Building a Lexical Knowledge-Base of Near-Synonym Differences Diana Inkpen Doctor of Philosophy Graduate Department of Computer Science University of Toronto 2004 Current natural language generation or machine translation systems cannot distinguish among near-synonyms—words that share the same core meaning but vary in their lexical nuances. This is due to a lack of knowledge about differences b...
متن کاملAcquiring Collocations For Lexical Choice Between Near-Synonyms
We extend a lexical knowledge-base of near-synonym differences with knowledge about their collocational behaviour. This type of knowledge is useful in the process of lexical choice between near-synonyms. We acquire collocations for the near-synonyms of interest from a corpus (only collocations with the appropriate sense and part-of-speech). For each word that collocates with a nearsynonym we us...
متن کاملExperiments on Extracting Knowledge from a Machine-Readable Dictionary of Synonym Differences (Invited Talk)
In machine translation and natural language generation, making the wrong word choice from a set of near-synonyms can be imprecise or awkward, or convey unwanted implications. Using Edmonds’s model of lexical knowledge to represent clusters of near-synonyms, our goal is to automatically derive a lexical knowledge-base from the Choose the Right Word dictionary of near-synonym discrimination. We d...
متن کاملExperiments on Extracting Knowledge from a Machine-Readable Dictionary of Synonym Differences
In machine translation and natural language generation, making the wrong word choice from a set of near-synonyms can be imprecise or awkward, or convey unwanted implications. Using Edmonds’s model of lexical knowledge to represent clusters of near-synonyms, our goal is to automatically derive a lexical knowledge-base from the Choose the Right Word dictionary of near-synonym discrimination. We d...
متن کامل‘Repetition’ in Arabic-English Translation: The case of Adrift on the Nile
Abstract This study investigates ‘repetition’ in the English translation of the Arabic Novel, Adrift on the Nile (1993). It aims to explore the communicative functions of ‘repetition’ and to see if these functions have been maintained or lost in the process of translating the Novel. In addition, it seeks to find the translation strategies used in rendering ‘repetition’. To achieve this aim, a d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computational Linguistics
دوره 32 شماره
صفحات -
تاریخ انتشار 2006